Model Selection

UI Element Detection

# UI Element Detection

OmniParser is a universal screen parsing tool capable of interpreting/converting user interface screenshots into structured formats to enhance existing LLM-based UI agents.

Paligemma 3b Ft Waveui 896

A UI element detection model fine-tuned from PaliGemma 3B 896-resolution weights, specializing in object detection tasks

Transformers English

Qwen Vl Guidance

GUIChat is a multimodal model based on Visual Question Answering (VQA), capable of understanding image content and answering related questions, specifically optimized for GUI element recognition and interaction.

Paligemma 3b Ft Widgetcap Waveui 448

A vision-language model fine-tuned for object detection tasks on the WaveUI dataset, based on PaliGemma 3B 448-resolution weights

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase